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1. INTRODUCTION 

Dissolved Air pollution has been the main problem to be concerned in a developed and developing 
countries [1-3]. Ozone layer is an optional photochemical toxin created from an assortment of common and 
anthropogenic antecedents that incorporate mechanical and vehicular emanation of unstable natural mixes 
and oxides of nitrogen [4]. Ozone is an imperative constituent of the air that keeps up the warm structure of 
the stratosphere and the troposphere [5]. Numerous studies focused on prediction ozone concentrations [6]. 
These proposed models are not accurate enough. Many researchers focusing now on using Artificial 
Intelligence techniques such as support vector machine in replacing the conventional method in prediction 
[7-12]. Therefore, the main objective of this study is to investigate the capability of SVM in predicting Ozone 
in three different locations in Malaysia. Hyper parameters optimization will be investigated to improve the 
accuracy of the proposed model. Then, finally sensitivity analaysis will be introduce to test the importance of 
each input on the proposed model. 
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2. METHODOLOGY 
2.1. Study Area and Data 

Nine air quality monitoring stations were selected for this study as shown in Table 1 namely; Klang 
(S1), Petaling Jaya (S2), and Cheras (S3). Cheras is suburb of Kuala Lumpur, the capital city of Malaysia. 
The township is located to the south-east of Kuala Lumpur. The air monitoring station at Sekolah Menengah 
Kebangsaan Sri Permaisuri with coordinate at (3°06'25.2"N 101°43'03.9"E).Petaling Jaya, Selangor is 
geographically located at coordinate (3°08'16.7"N 101°36'29.7"E) is located within Klang Valley region and 
covers an area of 97.2 km’. Its location near to Kuala Lumpur’s city centre, surrounded by residential, 
commercial and industrial areas leads to a high volume of traffic. The monitoring station for Petaling Jaya 
was located at Sekolah Rendah Bandar Utama, Petaling Jaya with coordinate (3°08'16.7"N 101°36'29.7"E). 

In order to conduct the prediction ozone layer concentration a three set of data were collected from 
the Department of Environmental Malaysia. Table 2 represent the minimum, maximum, average and 
correlation of the date set of Cheras. In this data, it contains 7 parameters which are Wind Speed hourly, 
humidity hourly, NOx Hourly average, So2 hourly average, No2 hourly average, O; hourly average and Co 
hourly average. 


Table 1. Air Quality Monitoring Stations Description 








Air Monitoring ecation Back pround Coordinates 
Station Latitude Longitude 
Sl Klang Industrial 3°6.'20.0"N 101°24'48.4"E 
$2 Petaling Jaya, Selangor Industrial 3°08'16.7"N 101°36'29.7"E 
S3 Cheras, KL Urban 3°06'25.2"N 101°43'03.9"E 





Table 2. Analysis of Parameters 








Parameters Min Max Avg R 
Ws hourly avg 0.8 1.5 1.15 0.716394 
Humidity hourly avg 100 100 100 -0.74198 
NOx hourly avg 0.031 0.033 0.032 -0.51264 
So2 hourly avg 0.002 0.004 0.003 0.17301 
No2 hourly avg 0.024 0.028 0.026 -0.32457 
O3 hourly avg 0.002 0.014 0.008 1 
Co hourly avg 0.57 0.81 0.69 -0.47499 





2.2. Regression in Support Machines 

An SVM comprises network architectures similar to that of ANNs, which have been pruned in order 
to acquire model simplicity or enhance generalisation. Nevertheless, the approaches for determining network 
architectures regarding either model are dissimilar. Determining suitable ANN architectures typically 
requires manual procedures involving trial and error that mostly depends upon the past experiences and 
preferences of users, with derived weights that cannot be interpreted [13-15]. On the other hand, SVM 
network architecture is analytically confirmed through SVM algorithms, with optimised support vector 
networks obtained as a result. 


3. RESULTS AND DISCUSSION 
3.1. Parameter and Characteristic of SVM 

To discover the proposed model for the study, we had to go through SVM to find out the best kernel 
function for the study. Examples of kernel functions are Radial Basis Function, linear function, polynomial, 
and sigmoid function. The main reason of the findings is to come out with the best kernel functions that could 
produce an excellent result. Besides that, digit parceling preparing information is aligned by applying the 
SVMs with various portions to make the last models engineering. Table 3 presented a similar examination for 
expectation outcome of the S total SVM display utilizing four kernel functions. It could be watched that 
concerning connection coefficient values (R2) while utilizing the RBF bit, the expectation exactness of the 
test session information ended up being the best with around 0.86, trailed by polynomial (0.853), the linear 
kernel (0.77), and the sigmoid kernel (0.062). 

In real scenario, finding out for a good SVM model probably the first things to be done for the best 
structure of SVM model for specific application, there are two indispensable parameters that expected to be 
chosen in particular; limit parameter C and ¢ [16-18]. The determination of C is exceptionally delicate to the 
exactness of the forecast, as the little estimation of C could tend the model to under gauge the objective 
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esteem amid the preparation information, this is because of the way that utilizing generally little weight as a 
part of the preparation information would mirror a bigger estimation of the indicator while inspecting the 
model in the testing dataset and vice-versa. On the highest point of that, when C is huge, the weight will lose 
its centrality on recognizing the mapping between the info and the yield. On the other hand, the huge 
estimation of C could mirror an extensive variety of support vector's qualities; in like manner, supplementary 
information records could be decided for enhancing the support vectors. Moreover, substantial estimation of 
= could tend for less number of the support vector to be accomplished and afterward the normal 
representation of the proposed clarification is deficient. Moreover, too substantial estimation of ¢ could 
prompt to disintegration of the level of exactness amid the preparation. In this specific situation, the choice of 
the ideal estimations of both C and ¢ ought to be accomplished through a few experimentation systems. 
Once the ideal estimations of these two parameters are recognized, there is a high potential to accomplish 
abnormal state of precision for foreseeing the fancied information. 


Table 3. Outcome from each Kernel Function 








Kernel Function ae MEE os - 
Training Test Training Test 
Linear 0.000 0.000 0.770 0.768 
Polynomial 0.000 0.000 0.839 0.838 
RBF 0.000 0.000 0.853 0.854 
SIGMOID 0.002 0.002 0.062 0.036 





Current review, motivation behind deciding proper estimations for parameter C and ¢, the replication 
forms a few parameters landscapes likewise observed the highlights for SVM S total by way of real stride to 
conduct expectation display. Toward the starting, a thought of consistent estimation of ¢ to be 0.1, 
and variable estimations of C to be gone somewhere around 0 and 10 keeping in mind the end goal to 
assemble the proposed show amid the instructional course of the info yield information [19-21]. Thusly, 
a computation of the expectation mistake as RMSE and R2 is completed with recognizing the quantity of the 
quantity of the support vectors. It could be portrayed in Figure 1 that slight decrease in the quantity of the 
support vectors and the estimation of estimation of RMSE are accomplished while the utilized estimation of 
C expanded, then again, the estimation of the R2 increments. Moreover, with concentrating on the parameter 
C, it could be watched that the most minimal estimation of RMSE point (0.816088231) and one high 
connection coefficient esteem (0.920869155). Correlation Coefficient pattern increase on point then it drops 
and increase. Subsequently, it is ideal to choose parameter C to be 2.0. 








(c) (d) 


Figure |. (a) Capacity Vs Correlation coefficient (b) Gamma VS RMSE (c) Epsilon- RBF Model 
(d) Nu-RBF Model 
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Really, it is important to make sense of the suitable estimations of the hyper parameters C and y as a 
noteworthy stride while actualizing any SVM show [22-24]. In this way, there is a need to complete 
experimentation methodology. In this unique situation, estimation of the summed-up exactness using 
different estimation of part hyper has been finished. With deference, the impediment of parameter y, it was 
chosen to hunt its ideal esteem to be inside the scopes of [0.20, 2.0] at augmentation of 0.2 for y with being 
altered to each C=2 and e=0.2. Thusly, the ideal determination estimation of y is discovered utilizing 10-fold 
cross-approval with dull ten times keeping in mind the end goal to upgrade the unwavering quality of the 
model results. In the instructional course, the last engineering of the model in the opposition, when such 
parameters may deliver the base mistakes amid the approval session. Figure 1(b) demonstrates the 
relationship between RMSE and y, where it started in the estimation of relationship coefficient ascend with 
expanding y until it achieves its pinnacle esteem at 0.4 and after that esteem begins to stay continues 
alongside in the quantity of support vectors. The estimation of parameters that gives the base speculation 
blunder is then chosen. The best result for the S total display in preparing and gauging stage while selecting 
y=2.0 with a worthy estimation of number of support vectors 622. 

For the most part, selecting the ideal number of parameters contributions of a specific SVM model is 
fundamental stride; notwithstanding, cutting-edge it is hard to discover certain hypothesis that could be 
utilized to chaperon accomplishing this progression. While playing out the preparation and testing session of 
SVM model, a similar info course of action of the information set of stone segment parameters that were 
utilized have been said as a part of the past areas of this paper. Indeed, looking for the model parameters 
assumes a vital part in accomplishing a decent execution for SVM. Set the hyper parameters C, y and the 
portion parameters (Epsilon and Nu) is considered as indispensable stride in impacting on the SVM 
speculation execution (estimation exactness) [25]. This segment will be centred in the utilization of the two 
sorts of RBF piece for its great execution and points of interest in ozone concentration determining issue. 
The accompanying clarifies improved and chose part parameter values for each of the two models. 

Epsilon-Radial Basis Function display cast-off to produce a foreseeing S total, here settle C = 2 and 
y=2.0, customary « as different values somewhere around 0.01 to 0.9. Outcomes execution for the perfect in 
preparing also testing appeared in Figure 1(c), we can watched that the RMSE increment with expanding the 
estimation of ¢, until e=0.3 while each of the R2 and the quantity of support vectors that are diminished. For 
the last the ¢ esteem (0.2) that yields the base speculation mistake with worthy number of support vector 
(622) is then, picked. 

In performing runs with this model, we utilized ideal qualities in every specific gamma= 2.0, 
limit =6 parameters with an output. The model expands, as forecast precision of the model run information 
builds bit by bit to the most noteworthy esteem (when Nu=0.6) and afterward the estimation of the rest of the 
abatement as appeared Figure 1(d) additionally demonstrates the number identified with support vectors 
increment with the expansion in Nu esteem. 

Previous method of 10 - fold validation been used in trial and error method. Also, method of time 
series regression been tried. As the final testing, V-Fold validation been tested. The cross-approval 
development remains as one of broadly utilize strategies in assessing the values. With a specific end goal to 
utilize the approval, preparation of a set of data randomly part in a usual number for V-folds. At that point the 
chose sort of SVM model is performed successively to the perceptions that occurred to the V-1 folds. 
Methodology could achieve the eventual outcomes of the acting outline which illustration or wrinkle that was 
hidden while setting up the SVM appear; i.e., this is the trying example) keeping in mind the end goal to 
make sense of the mistake characterized by one of the measurable record. The real favourable position of this 
procedure is that the normal precision for the v times could prompt to steady quantify display mistake on the 
dependability.For an instance the lawfulness for testing session of subtle data with the model itself. 
Statistical evaluation total using 8, 12 and 16 as shown in Table 4. 

Table 4 outlines that the MAE and RMSE values accomplished using 8-crease cross-approval 
demonstrate best goodness fitting and extraordinary execution if looked at while using 12 overlay cross- 
approval. Moreover, the results obtain were for MAPE values. It appears to be basic for the decision to speak 
to for the component for both the preparation the model and testing session. 

Besides that, using Support Vector Machine this research also has been conducted using time series 
regression. The outcome from the run be identified based on the results. A value from the multilayer 
prediction model 6-9-1 presents the graph below. From the graph, we could determine that correlation way 
farer than the support vector machine. The value of the RMSE directly gave impact to the prediction 
outcome. 

A final touch is given to the whole research. Based on the findings, Support Vector Machine runs 
with the data of the other two places. The both data produce few values as below. The run was conducted by 
applying Nu = 0.6, Capacity = 2.0 and Gamma = 2.0. The model outcome considers the best. As we see the 
nearest value towards | is the best in the support vector machine. 
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Table 4. Statistical Evaluation total using 8, 12 and 16, V-fold Cross-Validation for Epsilon -Radial basis 
Function and Nu — Radial basis Function Models 
V- Fold 8 12 16 
Nu cc 0.719 0.854 0.84 
MAE 0.00524 0.01309 0.002190 
RMSE 0.84794 0.92412 0.91761 





Epsilon CC 0.843 0.523 0.758 
MAE 0.00748 0.01069 0.00567 
RMSE 0.91815 0.72319 0.87063 





Sensitivity analysis in information mining and measurable model building/fitting for the most part 
alludes to the evaluation of the significance of indicators in the particular (fitted) models. So, given a fitted 
model with certain model parameters for every indicator, what the impact would be of differing the 
parameters of the model (for every variable) on the general model fit. 

In Statistical Data Miner, affectability examination is accessible through a few alternatives; 
the specific insights and measures that will be accounted for will subject to the factual or information digging 
strategy for which the affectability investigation is requested. In Statistical Automated Neural Networks, 
the program will process the Sums of Squares residuals or misclassification rates for the model when the 
individual indicator is dispensed with from the neural net; proportions (of the lessened model versus the full 
model) are likewise reported, and the indicators (in the outcomes table) can be sorted by their significance or 
pertinence for the specific neural net. 

Sensitivity analysis is carried out by using two data mining which are Support vector machine and 
multilayer prediction. Support vector machine is tested by running with few parameters. Initially, the model 
is tested by using 3 parameters which are wind speed, humidity and ozone concentration. In the next model, 
4 parameters are used which are wind speed, humidity, NOx and ozone concentration. Lastly, 6 parameters 
which are wind speed, humidity, NOx, So2, No2 and ozone concentration are used to run the model. 
The values obtain from the run was 0.91214, 0.793095, 0.911043. The patterns of the values increase at 3 
parameters then drop when using 4 parameters and increase gradually when using 6 parameters as shown in 
Table 5. 

Besides that, the same model to be run by each parameter is tested one by one. The dependent 
variable of the run is Ozone Concentration (O3). The results are tabulated as per below. The results also show 
that the best value of was 0.889944. These results obtain from the run between humidity hourly average and 
ozone concentration hourly. The RMSE value of this particular parameter produces better results than 
another parameter. This could be the one of the main factors that disturbs the reading of ozone concentration. 
Statistical evaluation total using one to one parameter as shown in Table 6. 

















Table 5. Statistical Evaluation total using Table 6. Statistical Evaluation total using 
3, 4 and 6 Parameters One to One Parameter 
No of SVM RMSE Parameters Parameters R RMSE 
982 0.91214 3 Ws hourly avg 0.736 0.857904 
958 0.793095 4 Humidity hourly avg 0.792 0.889944 
1238 0.911043 6 NOx hourly avg 0.587 0.766159 
So2 hourly avg 0.189 0.434741 
No2 hourly avg 0.277 0.526308 
Co hourly avg 0.485 0.696419 





4. CONCLUSION 

The proposed model has affirmed its capability in predicting ozone concentration with high level of 
accuracy. Two types of SVM have been investigated in this study. Nu-SVM gives a RMSE value equal to 
0.92412 while epsilon- SVM gives a value of 0.91815 which way lesser than Nu- SVM. 
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